The Asymptotic Behavior of Minimum Buffer Size 
Requirements in Large P2P Streaming Networks 



Lei Ying R. Srikant Srinivas Shakkottai 

Dept. of ECE Dept. of ECE and CSL Dept. of ECE 

Iowa State University University of Illinois at Urbana-Champaign Texas A&M University 

Email: leiying@iastate.edu Email: rsrikant@illinois.edu Email: sshakkot@tamu.edu 



a^ ■ 

o : 

o . 

D - 

m 



C/2 



> 
m 
\o 

p 

G\ 
O 
OS 
O 



X 



Abstract — The growth of real-time content streaming over the 
Internet has resulted in the use of peer-to-peer (P2P) approaches 
for scalable content delivery. In such P2P streaming systems, 
each peer maintains a playout buffer of content chunks which 
it attempts to fill by contacting other peers in the network. The 
objective is to ensure that the chunk to be played out is available 
with high probability while keeping the buffer size small. Given 
that a particular peer has been selected, a policy is a rule that 
suggests which chunks should be requested by the peer from 
other peers.. We consider consider a number of recently suggested 
policies consistent with buffer minimization for a given target of 
skip free playout. We first study a rarest-first policy that attempts 
to obtain chunks farthest from playout, and a greedy policy that 
attempts to obtain chunks nearest to playout. We show that they 
both have similar buffer scalings (as a function of the number 
of peers of target probability of skip-free probability). We then 
study a hybrid policy which achieves order sense improvements 
over both policies and can achieve order optimal performance. 
We validate our results using simulations. 

I. Introduction 

Peer-to-peer (P2P) networks are rapidly becoming the 
medium of choice for content distribution over the Internet. 
Several studies have indicated their widespread adoption, and 
suggest that anywhere between 35-90% of Internet bandwidth 
is consumed by P2P applications [1], [2]. The initial applica- 
tion of the P2P idea was for file sharing, but as P2P file sharing 
networks have matured, many ideas have been transplanted to 
media streaming applications with a degree of success {eg. 
[3], [4]). As entertainment delivery over the Internet becomes 
increasingly main stream, P2P streaming is likely to assume 
an even greater significance. 

Streaming media using P2P has a more exacting set of 
constraints than file sharing since each chunk that constitutes 
the streamed file must be received within a certain deadline 
in order to allow for smooth sequential playing out of the 
media. A natural approach for designing streaming networks 
that inherently possesses this sequential nature of streaming 
is to create a multicast tree amongst the source and users 
either at the IP or higher layer [5]-[7], and to push chunks 
on the tree. However, this approach often needs significant 
infrastructural overheads to perform adequately; for example, 
an IP layer multicast requires support by the routers in the 
network. Another issue is that when end-users act as the peer 
nodes in the multicast tree, the churn caused by users arriving 
and departing causes significant inefficiencies [8]. 



A more popular approach to P2P Streaming is using appli- 
cation layer multicast with a full mesh topology among peers. 
The idea is to maintain a playout buffer, and to pull chunks 
into the buffer by communicating with a random selection 
of peers. This approach bears a strong resemblance to the 
BitTorrent technique [9] of selecting peers from a full mesh in 
order to exchange chunks [9]. However, the sequential playout 
constraint requires that the chunk to be played next is available 
when required. Several systems exist that use this approach to 
content streaming, and notable examples are CoolStreaming 
[10] that was one of the first systems to employ this approach, 
and some that are highly popular in East Asia such as PPLive 
[3], QQLive [4] and TVAnts [11]. 

In this paper we study optimal algorithms for mesh-based 
P2P multicast. Here, a server generates chunks at constant 
rate, and randomly selects on peer to push it to. Other 
peers select some other peer at random, and request some 
chunk in the selected peer's possession. The objective is to 
ensure that the chunk that needs to be played out is available 
with high probability. Under this paradigm, a distribution 
algorithm essentially answers the following question: Given 
that a particular peer has been selected, which chunk from 
that peer should be pulled'? The answer is not obvious, since 
on the one hand pulling the chunk with the shortest playout 
deadline might be optimal in the short term, but might yield 
poor performance as far as the rest of the peers are concerned. 
On the other hand, pulling a chunk that is farthest from playout 
would give maximum amount of time for its dispersal amongst 
peers, but this might be at the expense of the chunk that is most 
urgently required. It is also clear that the question cannot be 
answered independently of the buffer size, since a large buffer 
size gives more time for chunk dispersal than a short one. 

There has been considerable interest in P2P systems for both 
file delivery and streaming. Analytical models and studies of 
the achievable limits of P2P file delivery are presented in [12]- 
[16]. The primary objective of these studies is to ensure that 
all interested peers obtain the entire file with as short a delay 
as possible. Our current work uses some of the techniques 
used in these papers, but since we focus live streaming, the 
objective is somewhat different due to the sequential nature 
of playout. There have also been analytical and simulation 
studies of tree and mesh based P2P streaming [8], [17]. In [8], 
mesh versus tree approaches are considered primarily using 
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simulations, whereas in [17] upper bounds on performance 
are developed. Tree-based approaches are considered in [5]- 
[7], but since we consider end-user P2P, we focus on a full 
mesh pull based approach. The study in [18] considers using 
BitTorrent to directly assist in live streaming. Finally, the 
authors of [19], [20] consider a general class of streaming 
algorithms. Their main contribution is to develop an analytical 
model of P2P real-time streaming algorithms, and numerically 
show that based on available server capacity, different hybrids 
of greedy and rarest-first policies perform better than either 
of the constituent policies. Our work develops on the model 
introduced in [19], but our goal is to analytically answer the 
following question: how much buffer size reduction can be 
achieved using a hybrid policy? We are interested in this 
question since a large buffer size increases start-up latency 
and adversely impacts the "real-time" nature of the system. 
The model in [20] is slightly more accurate than the one in 
[19], but the model in [19] is more tractable which allows us to 
obtain design insights which are confirmed by our simulation 
results. 

Main results 

The system that we consider consists of M users who are 
all simultaneously interested in a real-time content stream 
generated by a server. The stream consists of chunks with one 
new chunk generated at each discrete time instant, and the 
server selects one peer at random for each new chunk. Peers 
obtain chunks either if they are selected by the server, or by 
full-mesh P2P with random peer selection. Peers maintain a 
buffer of size m, with the chunk in the mth location played out 
at each time instant if available. We have a target of skip-free 
playout probability of q over all peers. 

We present an analytical characterization of the scaling of 
required buffer size m with M and q under different policies. 
The main results of of our analysis are summarized below: 
> We consider the rarest-first policy wherein priority is 
given to the chunks farthest from playout. In other words, 
a peer picks that chunk in the difference set with his 
selected peer that is closest to the first buffer position. 
We show that given any target probability 0.5 <(?<!, 
the buffer size to attain this target probability with the 
rarest-first policy scales approximately as 

log(A/)+log(2g-l) + ^^^^. 

• We then consider the greedy policy wherein priority is 
given to the chunks closest to playout. Thus, a peer picks 
that chunk in the difference set with his selected peer that 
is closest to the mth buffer position. We show that given 
any target probability q > jj, the buffer size to attain this 
target probability with the greedy policy approximately 
scales in a similar fashion as 

log(A/) + log(g) + 

1-9+ M 

• We develop a hybrid policy which combines the greedy 
and the rarest-first policies. The policy uses rarest-first up 



to a buffer position where the probability of occupancy 
is greater than e, and switches to using the greedy policy 
from there on. For this policy, we show that there exist 
constants and a^ gg, independent of AI and q, such 
that, if the buffer size m satisfies 

m > alog{M) + ae,e2 log ' 

then the skip-free playout probability P[h,_,m){rn) > q. 

Thus, the buffer size required by this policy less than 

that required by either of the two policies it is composed 

of in an order sense. 
Above, log(.) refers to base 2. Consider an example scenario 
where we have M ~ 10, 000 and a target probability of skip- 
free playout q = 99.9%, we see that both the rarest-first and 
greedy poUcies would need a buffer size of the order of 1000s, 
whereas the hybrid policy would need a buffer size only of 
the order of 10s. We contrast the results with a straightforward 
lower bound. Given a target probability -p- < 9 < 1 , the buffer 
size to attain this target probability under any policy should 
be at least log M, which for our example is also of the order 
of 10s. Further, for a target of (7 > 1 — 1 /AI, the hybrid policy 
attains this lower bound in an order sense. 

Finally, we show using simulations that the order sense 
reduction in buffer size promised by the hybrid-policy do 
indeed materialize, particularly when the desired target of 
buffer occupancy is high, so arguing for the adoption of such 
hybrid chunk selection poUcies in P2P streaming systems. 

II. Background and Model 

We first introduce the model of a P2P content streaming 
system developed in [19] under a large-system assumption. 
According to the model, time is considered to be slotted 
and is synchronized across the whole system. The system is 
illustrated at some time slot in Figure [T] and consists of a 
single server and AI peers. The server is a source of real-time 
content such as media coverage of a live event, and generates 
one chunk of new data per time slot. The server has a limited 
communication capacity, and is able to transmit this chunk to 
exactly one peer. It does so by choosing one of the Ad peers 
at random. Each peer is assumed to have a buffer of size m, 
with the buffer positions indexed by i G {1, ..,771}. As shown 
in Figure [T] in each time slot they attempt to playout the chunk 
at buffer position m, causing a rightward shift of the buffer 
contents by one position. Since the data stream is real-time, 
peers are synchronized and they all attempt to playout the 
same chunk of data; in the figure the chunk labeled A is to be 
played out by all the peers. If the buffer position m is empty, 
there is a gap in play out, and the missing chunk is never 
recovered (i.e., the rightward shift occurs even if the chunk is 
missing, and peers remain synchronized). Hence, the peer at 
the bottom of Figure [T] will never obtain chunk A. 

Peers are part of a P2P network, and they can potentially 
obtain any chunk from each other until the point of playout 
of that chunk. Thus, chunks B through O can be obtained 
through P2P chunk sharing. Our P2P network model is a fully 



3 



1 2 



Buffer position rn 



H D B 



Server 
(selects 1 peer)^ 



I P2P I 



L I G E C 



— ^ Playout 



• Playout M Peers 



NML 



P2P j 

j1 fIeIdI 



' Playout 



Fig. 1. A P2P content streaming system. Content is generated real-time at 
server, and each chunk is sent to one of M interested peers. Peers try to 
obtain chunks via P2P among themselves. Chunks are shifted to the right by 
one position at each time step, and the chunk at position m is played out if 
available. We would like to ensure that all peers possess the chunk at position 
m with high probability. 



connected graph. Each peer chooses one peer at random, and 
may request a chunk from it. There are no restrictions on 
upload and download bandwidths, since with random peer 
selection the number of peers selecting any one peer is finite 
with high probability (so the results would essentially be the 
same even with such bandwidth constraints). Suppose that a 
peer has a set of chunks Q and its selected peer has a set of 
chunks TZ. Define A = TZ \ Q. Then a chunk selection policy 
/i is a rule by which the peer selects one of the chunks in A. 

The model in [19] is an equilibrium model, i.e., it studies 
the system assuming that it is in steady-state. We assume 
that the buffer occupancy probabilities have a steady-state 
distribution where the steady-state probability that buffer space 
i is occupied is denoted P{p,.m){i)- We assume that this 
distribution is identical and independent across peers. As we 
will see shortly, under the independence assumption, it is 
analytically tractable to characterize the impact of the rest 
of the system on a single peer This is similar to mean-field 
approximations in physics where, in a large system, the impact 
of the rest of the system on a single particle is captured by a 
"mean field." In our case, the large-system assumption means 
that M is assumed to be large. 

Under the above assumption, we can write down a simple 
relation between the steady state probabilities (at the beginning 
of any time slot) of buffer occupancies as follows: 

+ 1) =P(M,m)(*) 

V i > 1, (1) 

P{p.,n){l) = Yf (2) 

In the above, since buffer position i + 1 is filled by a rightward 
shift from buffer position i, its steady state probability at the 
beginning of the current time slot is the probability that i 
was already filled at the beginning of the last time slot, plus 
the probability that i was filled by P2P methods during the 
last time slot. The latter term is derived by considering that 
P(/x,ni)(*)(l '~P(fj.,m){i)) is the probability that a peer does not 
possess i but the selected peer does, and 
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where A is the difference set between the peers as defined 
above. In other words, is the probability that a peer 

TT chooses to download chunk i from its selected peer A, given 
that TT does not possess chunk i while A does. This selection 
probability is a function of the chunk selection policy fi. 
As mentioned earlier, a number of independence assumptions 
have been made in arriving at the above model; see [19] for 
an informal justification. 

Two policies that are of particular interest are the following: 

1) Rarest-first: Under this policy (denoted by r), priority 
is given to the chunks that have the lowest steady- 
state probability. From ([T]i for any policy p, we have 
P{^L,m){j) < P{f,,rn){i) for j < i. Hcncc, rarest-first is 
equivalent to selecting chunks with priority 2 > 3... > 
m — l. An interesting result of [19] is that for this policy 
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P(r,m)(0- 



(3) 



2) Greedy: Under this policy (denoted by g), priority is 
given to the chunks that are closest to playout. From ([T]i 
this is equivalent to prioritizing those chunks that have 
the highest steady state probabilities. Hence, the greedy 
policy selects chunks with priority m— 1 > to — 2... > 2. 
It is shown in [19] that for the greedy policy 

= J] - P{g,m){m) + P(g^„r){i + 1). (4) 

The objective is to find a policy p that has the smallest value of 
TO for a target value of „j)(m). In other words, we would 
like to find a policy that requires the smallest buffer size for 
a desired probability of skip-free playout. 

III. Insights from a Fluid Approximation 

Our first objective will be to obtain insight into the perfor- 
mance of the chunk-selection poUcies by approximating the 
difference equations in the previous section by differential 
equations. The resulting model is called the fluid model. 
While the fluid model is not precise, the main purpose of 
this model is to provide key intuition which will serve as 
the basis for our analysis in the next section, where will 
rigorously prove all results in the next section by directly 
working with the difference equations. Let us first consider 
the expression describing the steady state probabilities in ([T]). 
We can approximate this system of difference equations by 
using differential equations in the following manner. 



dp(i^,m)ii) 
di 



(0) 



(6) 



--P(M,m)(«)), (5) 
1 

M' 

Consistent with the fluid approximation, when applying the 
above equation to the greedy policy, we will replace pi^i by 
Pi in the expression for S(^g „i){'i-) given in (|4|. 

In this section, we will provide a heuristic explanation of 
the main ideas in the paper by using the above fluid model. 
In the later sections, we will use this intuition to derive our 
main results directly from the discrete model. 



4 



A. Buffer sizing for the rarest-first policy 

We first use the fluid model to study the minimum buffer 
size required to achieve a given probability of buffer occu- 
pancy. We have the following result. 

Fluid Result 1: The buffer size m required to attain 
^(^.^^(m) = q using the rarest-first policy is 9(logA/) + 
9(1/(1 

Proof: From ^ and (O, the dynamics of the rarest-first 
policy are given by 

dp{r,m)ii) 



di 



?'(r,Tn)(«)(l -P(r,m)(«)) 



Solving the above differential equation using condition (|6]l 
yields 



di 



P{r,ni) 1 - P(r,ni) (1 - P(r,m))^ ' 

^ = ln(p(.,™)(OM)-ln(i^^i^) + 



1 



1 



1-P(r,m)(i) l-ir' 

Thus, for a target P(^r,m) (™) = the required buffer size is 
given by 



m = ln{qM) — In 



i-q 



The desired result follows. ■ 

B. Buffer sizing for the greedy policy 

We have the following upper and lower bounds on the 
minimum buffer requirements for the greedy algorithm. 

Fluid Result 2: The buffer size m required to attain 
P{g m){m-) ~ q using the greedy poHcy is 0(logi\/) + 
0((l/(l-<7))log(l/(l-g))). 

Proof: From (HI that, with P(g.m){m) = q, we have 

S{g,m){i) = 1 ^ ^ ^ ]g +P{g,m){i + 1) > 1 ^ 9, (7) 

where the inequality follows since P(g.m) (*) is increasing and 
P(g,m){^) = i/AI. Hence, the solution pg.m to 

dP{g,m} 



di 



S(g.j7i) 



is greater than the solution Pg^m to 

dP{g^,n 



di 



(1 - '?)P(g,m)(j)(l -P(s,m)(«))- 



The latter equation can be solved as follows: 

dP{g,,n) (i) 



di 

(1 - q)di 
{l-q)t 



(1 -9)P(g,™)(i)(l -P(s,m)(i)) 
dP{g,,n}{i) 

P(s,m)(*)(l -P(g,m)(i)) 

pm~pm 



In 



(l-p(z)MO) 



(8) 



'We use the notation 0(log M) + 0(1/(1 - q)) to denote that, with M 
fixed, the asymptote behaves like 0(1/(1 — q)) and, with q fixed, it behaves 
like 0(logJ\/). 



Using P(g,™)(0) = 1/M, pi^g^rn){m) = q and setting i = m, 
the above yields 



1-q (1 - q)/M ' 

which gives the desired result. ■ 
Fluid Result 3: The buffer size m required to attain 

P(g m) ('7^) = q using the greedy policy is ri(log M)+Vl{l/ [1 — 

q))- 

Proof: Fix a e (0, (<z-l/M)/(l-g)). Since p(g,™)(0) = 
1/M and P{g^m){'m) = q, there must be some k £ (0,m) for 
which p(g „i-) (fc) = 1/M + a{l — q). Then for all i < k, since 
p{i) is an increasing function, we have the following upper 
bound on S(g ,„) (z) : 

s{l) = l-q-l-+ p{i) < 1 _ 5 _ _L + = c{q, M), 

where 

c(q,Af) ■.= {l + a)il-q). 
Thus, the solution to 

dp{gjy 



di 



s(OP(s,rn)(0(l -P(s,m)(i)) 



is upper bounded by the solution to 

dP{g,m) 



di 



c(q, M)p(g_™) (i) (1 - p(g_,„) (i)). 



Solving the second differential equation above yields 

, p{i){l-p{0)) , 
(1-M*))p(0) ^ 

Letting i ~k and substituting p(A;) = 1/71/ + q(1 — (jf), gives 



(9) 



m > k 



1 ^^l/M + a{l-q) 



c{q,M) 
I 



In- 



(l/M) 
1 - 1/M 



c{q,M) 1-1/M ~a{l-q) 

It is easy to check from the above expression that m is 
il{\ogM) when q is fixed and $1(1/(1 — g)) when AI is fixed. 

■ 

C. A hybrid policy 

Our conclusions from the previous two subsections are: 

• The buffer size requirement for both the rarest-first and 
greedy policies have logarithmic scaling in the number 
of users 8 (log A/). 

• The buffer size requirement grows at least as 1/(1 — g) 
for both policies when the desired skip-free playout 
probability is q. 

This suggests that the buffer size requirement could be high 
if very stringent QoS is required, i.e., a skip-free playout 
probability close to 1 is required. In the next sections, our 
goal is to understand if there is a different policy that can 
provide significant reduction in the buffer-size requirement. It 
has been observed in [19] that the buffer-size requirement can 
be reduced by a using a hybrid policy which uses the rarest- 
first policy under certain conditions and the greedy policy 



5 



under certain conditions. Since both the rarest-first and greedy 
policies have similar asymptotic performance, we now use the 
fluid model to get some insight into when one policy performs 
better than the other, which would be helpful in designing and 
analyzing hybrid policies. From Q we have that for the rarest- 
first policy 



(10) 



whereas for the greedy policy with a target P{g.m){'n^) = 1^ 
from © 



Hg,m){i) = P(g,m)(« + 1) + 1 - 9 - 1/A^ 
~ P(g,m)(«) + 1 - <? - lA^, 



(11) 



where the approximation is motivated by the fluid model. 
Since P(r,m)(l) = P(3,m)(l) = 1/^^, for M > 2 and 
small (1 - g), S(g ,„)(!) < S(r,m)(l)- Thus, 

P(r,m) (2) > 

P(-g „j)(2). By induction, it follows that, for all j such that 
P{r,m){i) < 0.5, we have S(g,m)0') < S(^^,„)(j), which im- 
pUes P(r,m){j + 1) > P{g,m){j + !)• This suggcsts that, for the 
lower buffer positions, larger buffer occupancy probabilities 
can be obtained by using the rarest-first policy. 

Next, let us consider the higher buffer positions, i.e., 
the ones numbered m,m — 1,.... Suppose that both the 
rarest-first and greedy policies exactly achieve P(r.m)("^) = 
P(g,m){'ni) = q- Then, from dTol i and (fTTI) . it follows that 
■S(g,m)(™) > S(^r.m){Tn)- Duc to the monotonicity of both 
P(r,m)(*) ™d P[g^m){'i')i it follows that there exists a k such 
that S(^g^^){i) > S(r.,n)(j) for all i > m. When (1 — q) 
is small and M is large, such a k would correspond to 
P{g.m){k) = 0.5. In the fluid model, since both P{g_m){i) 
and P(r.m){'i) vary continuously as functions of i (which is 
also a continuous variable in the fluid model), we make the 
following observation: to allow for the largest increase in 
buffer-occupancy probabilities, use the rarest-first policy till 
the buffer position where the occupancy probability is 0.5 and 
the then switch to the greedy policy. While this policy may 
not be optimal, the fluid model suggests that this may be a 
good heuristic to combine the rarest-first and greedy policies. 
In the next section, by directly working with the discrete-time 
model, we show that such a hybrid policy leads to significant 
reductions in the buffer size requirements for skip-free playout 
and is also optimal in an asymptotic order sense. 

IV. Discrete Time Model: Lower Bounds on the 
Buffer Size Requirement 

In this section, we obtain lower bounds on buffer size 
requirement. We first introduce two notations that will be 
extensively used in our analysis. We denote by ?^(^,rn);-(j the 
largest index i of the buffer spaces such that P(p^m){i) < 
and ?T.(;^,,n);+(j the smallest index i such that P(p,rn)(*) ^ 
i.e.. 



{i ■ P{ii,,n){i) < q} 
in{i ■■ P(^,^yn){i) > q} ■ 



n(i^,m):-q = max 
n'(f,.m);+q mm 
It is easy to see that ri(,,,„):_g < < + 1. 



We first provide a simple lower bound on the buffer size, 
which holds for all chunk selection policies. 

Lemma 1: For any jj < q < I, any chunk selection policy 
/i and any buffer size m, the following inequality holds 

n{f,,m);+q > log A/ + log g. 



Proof: First, we know that 

P(M,m)(j + 1) 
= Pi^.m) U) + Pi^.m) ( j) (1 - (j)) S(^,m) («) 

= P(M,m)(j) (1 + - P{^,,m}{j))S{^,,„^){i)) 
< 2p(^,„)(j). 

Since p{0) = 1/A/, it follows from a simple induction 
argument that 

P(M,m)0' + l) < J^- 



Thus, 



> log M + log q. 



A. Rarest-first policy 

In this subsection, we obtain a lower bound on the buffer 
size required by the rarest-first policy. 

Theorem 2: Given any target probability 0.5 < q < 1, the 
buffer size required to achieve this target probability with the 
rarest-first policy is at least 



log A/ + log(2g- 1) 



logg-log(2g- 1) 
log(l + (2-2g)2) 



1. 



Proof: Recall that S(^r,m){i) = 1 — P{r,m}{'i) and 

Pir,,n){j + 1) =P{r,,n){j) + P(r,,n){j) (l " P(r,Tn) (j)) ^ • 

The above difference equation is difficult to solve, so we 
first lower-bound the number of buffer spaces to reach an 
occupancy probability of {2q—l) and then further lower-bound 
the additional buffer space required to reach an occupancy 
probability of q. 

First from Lemma [T] we have 



1) >logA/ + log(2g-l), 



(12) 



which is a lower bound on the buffer size required to reach 
an occupancy probability 1 — 2(1 — g) = 2q — 1. Next, we 
bound n(r,m);-(2cj-i) ~ ^(r,m);+(} (the additional buffer space 
required to increase the occupancy probability from 2g — 1 to 
q). Under the rarest-first poUcy, for any j > n(r,m);+(2g-i)j 
we have 

P(r,m)(j + 1) = P{r,m){j) + P{r,,n)iJ) - P{r,,n){j))^ 

< P(™)(j)(l + (2-2g)2). 

Thus, 



P(r,m)('^(r,m): + (2g-l) + j) 



2\0 



< P{r,-m){n{r^yn); + (2q-l)) (l + (2 - 2qf') 



(13) 
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To obtain an upper bound on P(r,m)("(r,m);+(29-i))j we 
know that 

P(r,rn)('T'(r,m); + (2g-l)) 
= P{r,m)in{r,m);-(2q-l)) + 

P(r,m)('^(r,m);-(2q-l)) (l ^ P(r,jri) ("(r,m); - (2g--l) )) ■ 

It is easy to verify that a; + a:;(l — a;)^ is an increasing function 
in X for a; > 0, so we have 

P(r,m)("-(r,m); + (2g-l)) < (2q - 1)(1 + (2 - 2g)^). 

Substituting into inequahty (fT3T l. we obtain 

P(r,m) {n^r,m); + i2q-l) + j) < (2? - 1) (l + (2 - 2q)^y~^^ . 

Now we can conclude that 

q < P(r,m){n(r,m);+q) 

< (2g-l)(l + (2-2g) 
which implies that 

logg - log(2(7 - 1) 



where the first inequality holds because 

P(s,m)(« + 1) < P(g,m)('^(g,™);-^ + 1) 



Thus, 



<P(,„,(l)(l+(l-J,)(l-,+ A))'V™.:.*-\ 

which yields 

1 

> ; : ^ + 1 



> 



1 



1. 



N2\"(r,,Ti); + ,-n(r.m); + (2g-l)+l 



Recall P(g_m){'i + 1) < 2p(g „i)(i) holds for all i. It is easy 
to verify that 

n(g,my,+q - nf^g „^y _A_ > log M + log g - 2. 



> 



- 'H-"0; + (2?-l) - (1 + (2 - 2g)2) 

Lemma [T] is obtained by summing inequalities (fTZl i and ( fT4] i. 

■ 

Remark: Note that when S is small enough, we have 
log(l + 6)^6. So 

logg-log(2g-l) _ -(l-g) + 2(l^g) ^ 1 
log(l + (2-2g)2) ~ 4(1 -g)2 2(1 - g) ' 

which implies that the buffer size required by the rarest first 
policy is fi(logM) + 

B. Greedy policy 

In this subsection, we characterize the buffer size required 
by the greedy policy. 

Theorem 3: Given any target probability q > jj, the buffer 
size to attain this target probability with the greedy policy is 
at least 

log A-/ + loggr - 1 + ^ 

log(2-<z+^) 

Proof: Recall that S(g^,„)(i) = 1 - ig - P{g,m)(.m) + 
P(g.m){i + !)• Under the greedy poHcy, 

P(S,m)(« + l) = P(3,m)(i) +P(s,m)(«)(l --P(s,™)(*)) X 

1 



- 1. (14) Note that n 



(9,™);-T^ ^ because 
P{g.m){n(g^jn)-+^) < "o" ' ^° ^c Can conclude that 

n{g,m);+q > ^ "-(g.m);-^^ + 

> log M + log 9 - 1 ^ 



Remark: When 1 — g and jj are sufficiently small, we have 



1 



1 



2_ ■ 

M 



1 - -J'(s,m)(H +P(g,m)(« + 1) 



Note that 



P{9,mMg.n);+^) < JJ 



since P(^g_m){i + 1) < 2p(g,„)(i) holds for all i. Now if 
P{g.m)['>n) > q, then for any i < n(g ,„)._^, we have 

P(3,m)(« + 1) 
< P{g.m) (j) + P(g^m) (^^ - JJ^ ^ ^ Jl 



log(2-g+^) 1-q 

According to the theorem above, when I — q > jj , the buffer 
size needs to be at least log M+ ■2(i-q) ' '^^'^ when l — q < p-, 
the buffer size needs to be at least log AI + 

From Theorem ID and Theorem [5] we can see that either the 
rarest-first policy or the greedy policy, if used along, require a 
buffer with size ^l{logM) + U{j^). Thus, if we have a target 
probability p{n) = 99.9%, the buffer has to have at least 1000 
chunk spaces. However, from the analysis, it is not difficult to 
see that under the rarest-first policy, the occupancy probability 
increases slowly when close to q; and under the greedy policy, 
the occupancy probability increases slowly at the initial stage. 
Similar observations can be found in heuristic argument and 
the simulations in [19]. This motivates us to consider hybrid 
policies that use the rarest-first policy on the buffer spaces with 
small indices and the greedy policy on the buffer spaces with 
large indices. We will demonstrate in the next section that a 
properly designed hybrid policy only requires 6 (log buffer 
size for any target probability q such that q> 1 — jj. 

V. Discrete Time Model: Hybrid Policy 

The insight obtained on hybrid policies from the fluid model 
of Section |lll] was "use the rarest-first policy till the buffer 
position where the occupancy probability is 0.5 and the then 
switch to the greedy policy." In this section we will analyze 
this policy using a discrete model and show that this insight 
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is indeed valid and achieves the minimum buffer size (in an 
order sense) required for a given miss probability. We first 
formally define the policy. 

Hybrid Policy: Let denote a policy, where e is the 
occupancy probability at which we switch from rarest-first to 
greedy. In other words, priority is given to buffer positions 



1 to 71 



with priority decreasing from 1 



(rarest first). If the selected peer has no chunks in this set. 



then requests are made for chunks in position ntr. 



1 to 



m, with priority decreasing from m '^^(r, );+e + 1 (greedy). 
The chunk selection function that determines the policy is then 
formally defined as follows: 

• Considering any i such that i < n(r,.);+e, the policy 
satisfies 

S{h,,7n){A;i) = 1 
if i e ^ and ^flll' 1} = 0, and 

S{h,,7n){A;i) = 

otherwise. 

• Considering any i such that i > ni^r. ■)■,+(! the policy 



satisfies 



if i e A and Af]{l, 
and 



S(h,,m){A;i) = 1 

'^(r,m);+e): * + 1, 
{h,,7n)iA;i) = 



,m} = 0, 



otherwise. 

We now analyze this policy and show that the buffer size 
requirement of this policy is optimal in the order sense. 

Theorem 4: Under the hybrid policy , if we have a target 
occupancy of the final buffer space q > 0.8, and the switching 
parameter e = 0.5, if 



> 



1 



log 



1 - S 



l-S(l-C2-q-S) 



log- 



log2Me 



log 1-9 



log(l + (l-e)2)' 



then mifn) > Q- Here, S ~ 0.8; a 
ei(l + a-6i)2);ei=e(l + (l-e)2). 

Proof: The proof is in three parts. We make the suppo- 
sition that Phc,m{'m) < q, and prove that it results in a lower 
bound on the buffer size m that violates our assumptions. 
Step 1. In the first part, we calculate the buffer size required 
for the rarest first policy to reach an occupancy probability 
that exceeds e > 0. Formally, we will determine the value of 
which for simplicity of notation within the proof, we 
will denote by n^^. Recall from dU and (|3]l that since we use 
the rarest first policy for buffer positions i < n+e — 1, the 
occupancy probability of buffer position i is described by 

+ = P(r,)(*)(l + (1-P(r,)(*))') (15) 

> p(,,)W(l + (l-e)2) 

> p{l){l + {l-ef)\ 



where we note that p{l) = 1/M is independent of the policy 
used. Thus, 

P(.,)(n+,)>Kl)(l + (l-ef)"^^ 

logMp(^^.)(n+,) 

^"+^-1 TTTTT ^2^• ^^^^ 

log(l + (l-e)2) 

Also we have from ( fTSl l that 

P{r,-){n+e) = 

P(r.){n+e P(^r.){n+e - 1))^) 

<e(l + (l-e)2) <2e, (17) 

where the last line follows from the fact that a:(l + (1 — x)^) 
is an increasing function. Finally, from (fTST i and (fTTT l. we have 



log 2Me 



log(l + (l-e)2)- 



(18) 



We next characterize the performance of the greedy part of 
the hybrid policy. 

Step 2. We first determine the buffer space required for the 
hybrid policy to exceed an occupancy probability S > e. In 
other words, we calculate 7i(^hc.m);+s^ and similarly charac- 
terize n(,j^ ,„)._5 = n(^h,.m}-+5 - 1- Again, for simplicity of 
notation within the proof, these will be denoted as n+s and 
ri-s, respectively. Consider the set of buffer positions j that 
satisfy n+£ + 1 < .7 < n^s- From Lemma |5] (see Appendix) 
and using a = P{h,,m){T^+e + 1) for convenience, it is easy to 
see that 

V(h,.rn){:i + 1) = 

(j) + P(h„ra){i){^ - P(h,.m){i)) 
X (l - a - Pi^h^.m^im) + P(h,.m){j + 1)) (19) 
>P(h,,m){i) + 

e(l - 6){l - a-]5(h,,m)(?7l) + 1)). 

Now, choosing < q, we have 

P(h„,n){j + 1) > 

P(h„m){i) + e(l - - a - q + P(h,,r,i){j + 1)) 
^ (1 - a)p(^h..m){j + 1) > P{h,.m){j) + a(l - a - q), 

where a = e(l — S). The above inequality allows us to 
recursively calculate p(^h^ m){n'+e + k) for A: > 1, and we 
will use it in order to determine n_5. We have 

(1 - a)p(,,^,„)(n+£ + 2) > p(h,.-m){n+e + 1) + a(l - a - q), 
(1 - a)p{h,,m){n+^ + 3) > + 2) + q;(1 - a - q). 

Hence we have 

(1 - afp(h,,m){n+e + 3) > 

(1 - OL)p(h,,yn){n+t + 2) + a{l - a)(l - a - g), 
> P(h,,7n){n+t + 1) + q:(1 - a - q) + a{l - a){l -a-q). (20) 
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fc-2 



We can generalize the above as 

{1 - a)''-^P(^h,,m){n+e + k) 

> P(/i,,m)(n+£ + 1) + a(l - a - q)^a{l - af 

1=1 

= a + (1 - a - - a) (1 - (1 - a)''-^) , 

where we have used our definition a = P{h,.7n){n+e + !)■ 
Choosing q > e = 0.5 implies that a + q > 1, which in turn 
means that 1 — a — q < 0. Then since (1 — a)(l — (1 — a)'''"^) < 
1, we have the relation 

(1 - a)(l - a - - (1 - q)'^'"^) >l-a-q. (21) 

From ( |20] | and (|2TI) we then have 

(1 - a)*^-V(h.,m)("+. + fc) > a + (1 - a - = 1 - (7. (22) 

We are now in a position to obtain a bound on n^s- Let k be 
such that n+e + fc = Also, by definition, (ri-^) < 

5. Using this fact, we see from ( |22] | that 



(5(1 -a)'^-^ > 1-g 



A; < 1 



1 



log 



1-q 



log(l — a) S 
In summary, we have now established that 



log—) 



1 1 ^ 

n — 



(23) 



where a = e(l — 6) and we chose e = 0.5. 
Step 3. Our final step is to characterize the buffer size 
required to take the probability of occupancy from 5 to our 
target Ph^,m{ni) > q. We first obtain an upper bound on 
Ph^,m{n+e + ^) that will be used in our analysis. Recall that the 
rarest-first policy is used up to and including buffer position 



This means that from (fTsl l we have 



P{h,,m){'n+e + 1) =P(r.){n+e) (l + (1 - P{r,-){n+e))'^) 

Then since < e and + — x)^) is an increasing 

function 



P{r,-){n+e) < e(l + (1 - e)2) = ei 



(24) 



P{h,,m)in+, + 1) < ei(l + (l-ei)2) ^£2, (25) 

and we have chose e = 0.5. We now consider the evolution 
of occupancy probability from 6 ^ q. From ( fT9l l we have 

1 -P{h.,,n){j + 1) = 

^-P{h.,m){j) - P(h,,rn){j){^-P{h,,m){j)) 

X (1 - a -p(,j^^,„)(m) +P{h,,rn){j + 1)) 
< (1 -P(/...m)(j)) (1 - ^ (1 - £2 - P(h,,,„)(m) + S)) , (26) 

where we have used a = ,„)(n+e + l) < €2- Now, suppose 
that P(;i,.m)('Ti) < g. Then from ( |26] | we have 

1 - P{K,m){i + 1) < (1 - P(/>,,m)0')) (1 - (5 (1 - 62 - q + 5)) 



In particular, by recursion on the above starting at 

1 - 9 < 1 -P{h,.m){m) 

< (1 - (1 - <5 (1 - £2 - g + 5))™-"+^ 

< (1 - 5) (1- ,5 (1-62-9 + 5))™-" + ^ 

We can rewrite the above in the following manner 

(m - n+s) log (1 - 5 (1 - £2 - (? + 5)) > log ^— |. 

Now, by assumption S < q so the right side of the above is 
negative. If we can find < 5 < q for which the left side is 
negative, we can obtain an upper bound on In other words, 
we search for 6 that satisfies 



0< (l-e2-g + <5) < 1 
- q + €2 - 1 < S < q + €2, 



(27) 



where from ( |25] | we have 62 < 0.72. Also, since e = 0.5, we 
have e + q > 1. Thus, sufficient conditions on S are 

S > q- 0.28 > g + 62 - 1 and 
S < I < q + e2. 

For example, we could choose S ~ 0.8, with q > 0.8. Thus, 



TO < + 



1 



■log 



1 



1-9 



(28) 



log i-S{l-e2-q+S) 

Finally, from the three bounds ( fTSl ), ( |23] l and ( [28] l we have a 
contradiction of the assumption on the size of buffer Hence, 
we must have P(h,,m){^) > Q, which yields the proof. ■ 

VI. Simulations 

In this section, we use simulations to further evaluate the 
performance of different chunk-selection policies. We first 
consider a network with a fixed number of active peers. 
Specifically, the network consists of one server and 10,000 
peers. Each peer has a buffer of size to. The network is a 
slotted time system. During each time slot, the server randomly 
selects a peer and uploads a new chunk to it, and each peer 
(expect the one who obtains the new chunk from the server) 
randomly selects another peer and downloads a chunk selected 
according to the chunk-selection policy. Figure |2] shows the 
minimum buffer sizes for attaining target skip-free playout 
probabilities under the greedy, rarest-first and hybrid polices. 
We see that the buffer size required under the hybrid policy is 
substantially smaller than the rarest-first and greedy policies. 
According to our simulation, the greedy policy requires the 
buffer size to be 183 to attain skip-free playout probability 
0.976, the rarest-first requires the buffer-size to be 166 to attain 
skip-free playout probability 0.996, and the hybrid policy 
only requires a buffer size of 40 to attain skip-free playout 
probability 0.999! 

In reality, the number of active peers in a P2P network 
changes over time because peers can dynamically join and 
leave. To study the performance of different polices with peer 
arrivals/departures, we consider a network with 20, 000 peers. 
. Initially, 10, 000 peers are active. At each iteration, an active 
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Fig. 2. The minimum buffer size versus target skip-free playout probability 
with a fixed number of peers in the system. The hybrid policy performance 
is an order better than either greedy or rarest first. 

peer becomes inactive with probability 0.001, and an inactive 
peer becomes active with probabiHty 0.001. 

In our simulation, a peer empties its buffer when it is 
inactive, and begins to play the video m time slots (start- 
up latency) after it becomes active, where m is the buffer 
size. Figure [3] shows the minimum buffer sizes for attaining 
target skip-free playout probabilities under the rarest-first and 
hybrid polices. The skip-free probability is computed based 
on peers who are playing the video (not including peers who 
are in their start-up phase). The greedy policy is not included 
because its performance is really poor; for example, the skip- 
free playout probability of the greedy policy with buffer size 
200 is still less than 0.90. From Figure [3] we see that the 
buffer size required under the hybrid policy is much smaller 
than the rarest-first policy. In fact, the rarest-first requires the 
buffer-size to be 125 to attain skip-free playout probability 
0.99, whereas the hybrid policy only requires the buffer size 
to be 39.' This simulation indicates that the hybrid policy works 
well even in P2P networks with peer arrivals/departures. 
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Fig. 3. The minimum buffer size versus skip-free playout probability with 
peer arrivals and departures. Greedy is not shown due its extremely poor 
performance. The hybrid policy easily does an order better than rarest-first. 

VII. Conclusion 

In this paper we considered the problem of designing 
efficient policies for real-time streaming applications using 
P2P approaches. Our objective was to ensure that the playout 
buffer for a given target of skip free playout remains as small 
as possible. We showed that both the rarest-first and greedy 



policy have similar buffer scalings, and that their combination 
into a hybrid policy yielded order sense improvements in the 
required buffer size. Further, the buffer size required by the 
hybrid policy is close to the minimum over all policies. Future 
work includes the design of policies which are designed for 
ensuring good QoS for specific video codec formats. 
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Appendix 

Lemma 5: Consider the hybrid policy h^. For buffer space 



such that i > n 



the steady state probabilities of 



occupancies satisfy 

P(h,,,n){j + 1) = 
P{h,,rn) 

X (l - a -P(/u,m)(»7t) +P(/i,,r„)(j 



where a = P(r,-)("-(r,-).+e + !)■ 

Proof: Recall that the hybrid policy first selects a thresh- 
old buffer position where the steady state occupancy probabil- 
ity using the rarest-first policy is greater than e (this position 
is called gives priority to the rarest-first algorithm, 

and uses the greedy algorithm if the difference set with the 
selected peer contains none of the chunks indexed from 1 to 
n-ir. ■),+£■ We first find the probability that the greedy algorithm 
is used. Let 1 — ^ ~ jj- According to the definition of the 
hybrid poUcy, we have for n(^r.-),+e > 1: 

W{h,,m) [a : . . . , n(^r.),+e} ^ 9j 

= 7 n {'^ - P{h,.rn)ij) - P{h„m)i.i))) 

= 7 n {'^ - Pir,m){j) ~ P{r,m){j))) 
= S(r,-)("-(r-r),+e + 1) 

= 1 — a. 

So 1— a is the probability that none of the chunks indexed from 
1 to ).+e is in the difference set. When this event happens, 
the hybrid policy uses the greedy policy, and the proof next 
is similar to the proof of Proposition 1 of [19]. Following ([TJ 
and (HI, the steady-state probabilities of occupancies can be 
written as 

P(h,_,7n){i + 1) 

= P(h,,m){i) + P(h,,m){i){'^ - P[h,,rn){i)) X 

m—1 

(l-a) (1 -P(/i„m)(j) (1 -P(h.,Tn)(j))) • 

j=i+l 

Defining s(i) = Iljl'^ii (l - (i) (l - P(/i„m) (j))) 

for z < m — 1 and s(rn — 1) = 1, the equation above can 
be written as 

P{h„rn){-i + 1) = 

(i) + (1 - a)s(i)p(^^.„)(i)(l ~ P(h,.7n){i)) (29) 

and 

s(i) 

,. , ,^ = 1 -P(/i,,m)(« + -Pih,,m){i + 1)), 

S[l + 1) \ ^' / \ 5. y 

which implies that 

s{i + 1) - s{i) = s{i + l)pi^h,,m}{'i + -P{h,.m}{-i + !))■ 
Substituting the equality above into equality ( |29] l, we obtain 

P(/i,,m)(i + 2) -P{h,,m){-i + 1) = (1 - a)(s(i + 1) - s{i)). 

Summing up the inequalities above from i to 7ti — 2, we 
have 

P(h,,rn){in) -P(h,,rn){i + 1) = (1 " a){s{m - 1) - s{i)). 
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Note that s{rn — 1) = 1, so 

(1 - a)s(i) = 1 - a - P(h„7n) (m) + P(h„7n) + 1), 
and the lemma holds. ■ 



